A universal quantum circuit for two-qubit transformations 
with three CNOT gates 
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We consider the implementation of two-qubit unitary transformations by means of CNOT gates 
and single-qubit unitary gates. We show, by means of an explicit quantum circuit, that together 
with local gates three CNOT gates are necessary and sufficient in order to implement an arbitrary 
unitary transformation of two qubits. We also identify the subset of two-qubit gates that can be 
performed with only two CNOT gates. 
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In the context of establishing the existence of univer- 
sal sets of two-qubit gates for quantum computation ^ , 
Barenco et al. 2] showed that any unitary transforma- 
tion on n qubits can be decomposed into a sequence of 
CNOT and single-qubit gates. Since then it has become 
customary to express n-qubit unitary transformations — 
associated e.g. with quantum algorithms — as a series 
of CNOT and single-qubit gates in the quantum circuit 
model ^. 

Relatedly, in those experimental settings where signif- 
icant single-qubit control is already available, the ability 
to reliably perform a CNOT gate has become the stan- 
dard hallmark of multi-qubit control. As a consequence, 
achieving a CNOT gate is one of the most popular and 
coveted goals among quantum information experimental- 
ists 0. In turn, this has triggered theoretical studies on 
the optimal use of two-qubit interactions and of entan- 
gling gates to perform a CNOT gate ^^J^,^. 

In this paper we consider the construction of quantum 
circuits that minimize the use of CNOT gates. Such op- 
timal constructions are of significant interest at least in 
two contexts. First, they play a role in determining the 
algorithmic complexity of a given quantum computation, 
that is, the number of elementary gates required to im- 
plement the corresponding n-qubit unitary evolution. A 
most remarkable result of is the explicit decomposi- 
tion of an arbitrary U G W(2") as a sequence of CNOT 
and single-qubit gates. This general construction, how- 
ever, unavoidably requires exp(n) CNOT gates, which 
renders the resulting quantum circuit inefficient. We re- 
call, on the other hand, that the n-qubit unitary transfor- 
mations relevant for quantum computation are precisely 
those that can be decomposed into only poly(n) elemen- 
tary gates for large n. Thus, given a unitary transforma- 
tion U € Z^(2") of interest, it is important to know how 
many CNOT gates are required to implement it. 

Algorithmic complexity is typically concerned with 
gates involving a large number of qubits. But quantum 
circuits that minimize the use of CNOT gates are also 



of interest for gates involving only a reduced number of 
qubits, for a very practical reason: in present day exper- 
iments, two-qubit gates as the CNOT gate are imperfect 
due to technological limitations. Therefore, in order to 
minimize the probability that an error occurs in perform- 
ing a certain unitary evolution U € i^(2"), it is instru- 
mental that the number of times the qubits interact is 
as small as possible. Unfortunately, it is not known in 
general how to optimally decompose U into CNOT and 
single-qubit gates, not even for a small number of qubits, 
nor how to determine the minimal number of required 
CNOT gates. 

In this paper we describe a universal quantum circuit 
for two-qubit unitary transformations U € Shi (4) con- 
sisting of only three CNOT gates and four rounds of lo- 
cal gates. The shortest circuit previously known requires 
four CNOT gates % We also show that three CNOT 
gates are required in order to perform a generic two-qubit 
gate, thereby establishing the optimality of the proposed 
universal quantum circuit. Finally, we characterize the 
subset of two-qubit gates that require only two CNOT 
gates and construct an alternative, smaller quantum cir- 
cuit for them. 

We consider two qubits, labeled A and B, and an ar- 
bitrary unitary transformation U G Shi {A). Let G 
Shi{2) denote single-qubit unitary gates acting, respec- 
tively, on qubits A and B, and let Ucnot denote a CNOT 
gate that has qubit A as control and qubit B as target, 

C/cNOT l^"z)^«)|^n)g = l^m)^® l^n ® m)^, m, n = 0, 1, 

(1) 

where i (B j denotes sum modulo 2 and |^0) (|^1)) is the 
eigenvector with eigenvalue 1 (—1) of the third of the 
Pauli matrices 
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Theorem 1. — An arbitrary unitary gate U G Shi(A) 
can be decomposed in terms of three CNOT gates and 
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single-quhit unitary gates ui,vi (to be specified below) as 
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An important element in order to prove theorem 1 is 
the decomposition of C/ € SU{4:) derived by Khaneja at 
al. and Kraus et al. namely 



(4) 

H = hx (7x ® (Tx + hy ay®ay + hz (Jz <^ (Jz, (5) 

where 7r/4 > hx > hy > \hz\. An explicit protocol to 
extract the single-qubit gates ui , vi , u'^, G SU{2) and 
the coefficients hx,hy,hz € TZ from U was presented in 
Ref. 10]. In what follows we show that e~*^ can be 
further decomposed as 
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where 



U2 = 






V2^e-''''"\ (7) 


U3 = 
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so that W4 and W4 in Q are 
Let us introduce the Bell basis 



f4 = v\w 



(10) 



I700) ^ ^(100) + 111)), I701) ^ ^(101) + |10)), 
I710) ^ ^(100) - 111)), I711) ^ ^(101) - |10)), 

where \mn) denotes |^w)^ ® Yri)^. Operator H in Eq. 
(O can be rewritten as 



= ^ A,„„|7„„)(7„ 



(11) 



m,n— 



with \nn defined as 



Aoo = hx — hy + hz 



Xoi ^ hx + hy - hz, (12) 
Aio = -hx + hy + hz, All = -hx - hy - hz. (13) 

Then e~*^ becomes 



|7mn)(7i7 



(14) 



7n,7i—0 



Direct inspection shows that circuit JH)) indeed acts on 
the Bell basis |7m„) as [ll| 



|7mn/ 



e-'^— |7„„), 



(15) 



thereby proving the theorem. We include some of the 
details. The first (leftmost) CNOT in ^ maps the Bell 
basis into a product basis, namely 



l^'m)^ ig) l^n) 



(16) 



where ^0) = (|^0) + \n))/V2, fl) = (1^0) - \n))/V2. 
The local transformation U2 ® f 2 introduces convenient 
phases (j^mnihx, hy) into this local basis, and maps it into 
a new product basis. 



"m)^ (g) Yn) ^ 



^-i4>mn{h^,h^)\Z 



|Ma'^I'")s- (17) 



The second CNOT gate exchanges only two elements of 
the new product basis (recall Eq. |^), 
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(18) 



after which U3 ® switches back to the I'^m)^ (g) |^n)^ 
basis and introduces more phases 4)'^{hy). The leftmost 
CNOT in (jBl maps the latter product basis back into the 
original Bell basis. 



I'm)^® |^n)j 
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and the final local gates w ® w'^ exchange vectors I710) 
and I711) in order to undo the permutation (|18|l [and 
also add a 7r/4 phase to each of them], so that circuit © 
implements transformation p5|) . 

As shown in j^, a nontrivial subset of two-qubit uni- 
tary transformations, namely control-V transformations 
for y e W(2), can be performed by using only two CNOT 
gates and single-qubit gates. For these gates, one finds 
hy = hz = va. its decomposition Q-®, so that they 
are locally equivalent to a control-phase gate U^, 
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for an arbitrary phase ip. Theorem 2 characterizes the 
set of two-qubit transformations that can be performed 
with only two CNOT gates. They correspond to ft,^ = 
in Q-®, and are therefore a subset of zero measure in 
the space of two-qubit gates. 

Theorem 2. — A two-qubit gate U G SU{A) can be 
decomposed in terms of two CNOT gates and single-qubit 
gates ui,vi, 



U — 



(21) 



if and only if hz — in its decomposition i^)- if3)) . 
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First we prove that if = 0, then U can be decom- 
posed as in (|21|l . Decomposition Q-© becomes 



with the contiguous CNOT gates on the LHS of if^ . 
leading to the equivalence, up to local unitary transfor- 
mations LU, expressed by 



-ii^r±_J^iir (22) 

H = hx(Tx®(Jx+hy(Ty®ay, kx > ky > 0. (23) 

One can now express e~'^ as 
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where 



U2 = e-'''- 



V2 = e 
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(24) 



(25) 



and ui,u'i,vi,v'i in l(?T)) and are related through 

ui = w''u\, vi = wv'i, U3 = U3W, V3 = v'r^w'' . (26) 

The validity of circuit (|24|l can be checked by reasoning 
similarly as we did in the proof of theorem 1. In partic- 
ular, the first local gates w\ (X) wb permute vectors I710) 
and I711). Then the leftmost CNOT gate maps the Bell 
basis Ijmn) into the product basis (g) \^n)^. Gates 

M2'8>'i'2 introduce convenient phases 4>mn{hx, hy). Finally, 
the rightmost CNOT and wa ® w^b gates map the local 
basis back into the original Bell basis. 

In order to show that hz = for any gate that can be 
decomposed in terms of only two CNOT gates and local 
gates, we assume the most general sequence of local gates 
and two CNOT gates. Up to initial and final local gates, 
this is equal to the quantum circuit on the LHS of (|25J). 
We recall that, in much the same way as an arbitrary 
rotation R g 5*0(3) can be decomposed as three rotations 
along, say, axes z, x and i, gates m, w € SU (2) can always 
be expanded as 



(27) 
(28) 



for some value oi au,hx,Cu,aTj,hy,Cv E TZ. The leftmost 
and rightmost exponentials in Eqs. H27|) - (|28f) commute 
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But these circuits are also locally equivalent to that in 
(|24|l . which corresponds to a gate U G SU (4) with = 0. 
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In Eq. 1151 we have neglected a physically irrelevant, 
global phase e"^^'* originating in that det([7cNOT) = —1 
in Eq. Q, i.e. Ucnot i SU{4). 



